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Abstract 

Power indices are mappings that quantify the influence of the members of a voting body on 
collective decisions a priori. Their nonlinearity and discontinuity makes it difficult to compute 
inverse images, i.e., to determine a voting system which induces a power distribution as close as 
possible to a desired one. This paper considers approximations and exact solutions to this inverse 
problem for the Penrose-Banzhaf index, which are obtained by enumeration and integer linear 
programming techniques. They are compared to the results of three simple solution heuristics. 
The heuristics perform well in absolute terms but can be improved upon very considerably in 
relative terms. The flndings complement known asymptotic results for large voting bodies and 
may improve termination criteria for local search algorithms. 
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1 Introduction 

Collective decision rules and, in particular, heterogeneous voting weights for members of a committee, 
council, or shareholder meeting translate into influence on collective decisions in a nonlinear and even 
discontinuous fashion. This can be seen, for instance, by considering a decision quota oi q = 50% and 
players i G N = {1,2,3} whose voting weights are given by either the vector (i) w = {wi,W2t'W3) = 
(33.3,33.3,33.3), (ii) w' = (50 - e,48 + £,2), or (iii) w" = (50 + e,48 - e,2) for small e > 0. The 
major weight change from w to w' does not affect possibilities to form a winning coalition at all, where 
coalition S' C TV is called winning if the cumulative weight of its members exceeds the quota. Namely, 
S is winning if and only if jS"! > 2. By symmetry, the distribution of influence can a priori be expected 
to equal (^, ^, ^) for either of the voting systems described by (g;w) and {q;w'). The minor change 
from w' to w" , in contrast, renders voter 1 a dictator with associated power distribution (1, 0, 0). 

Social scientists, philosophers and mathematicians have investigated various voting power indices 
which try to quantify the a priori distribution of influence on committee decisions. The Shapley-Shubik 
index ( [Shapley and Shubik 1954] ) and the Penrose-Banzhaf index (PBl) (jPenrose 19461 IBanzhaf 1965^ 
are most prominent, but by far not the only onesQ They help researchers clarify the non-trivial a priori 
power implications of different voting weight assignments to a wider audience. The combinatorial 
nature of weighted voting systems can, still, mislead the general public's intuition and even that of 
political practitioners. For instance, it was apparently not noted that the voting weights of the original 
six members of the European Economic Community, in use from 1958 to 1973, rendered Luxembourg 
a null player whenever the EEC Council applied its qualified majority rule. The public discussion - 
very heated in, but not restricted to, Poland and Germany - in the wake of the 2007 EU summit at 

^ See [Felsenthal and Machover (1998)| or |Laruelle and Valenciano (2008)] for overviews. 



1 



which new voting rules for the EU Council were agreed reflected persistent confusion between voting 
weights and power. 

Even to specialists, the discrete nature of voting rules still poses challenges. This is true, in 
particular, for the optimal design of a voting system. Certain normative desiderata, such as the equal 
representation of bottom-tier voters in a two-tier voting system, often call for a specific distribution of 
voting power. It is then a non-trivial exercise to find a deterministic voting rule that comes as close as 
possible to inducing the desired a priori power distributionlfl Simple gradient-like search algorithms, 
such as the ones considered by Leech (|2002[ |Aziz et al. (2007)[ or |Fatima et al. (2008)] deliver 



excellent results for many instances of this so-called inverse power index problem but have never been 
evaluated formally. One can neither rule out that only a local minimum of the distance between the 
desired and the induced power vector has been identified. Nor are bounds known on the possible gap 
to a globally optimal voting rule. The latter might involve the intersection of several one-dimensional 
{q; w)-rules and, therefore, need not even be a feasible result of the applied search algorithm. 

Motivated by qualified majority voting in the EU, Slomczyhski and Zyczkowski (|2006[ I2007P have 
identified an elegant way to approximately solve the inverse problem for moderately big n if the 
decision quota g is a free parameter!^ Their heuristic suggestion is particularly appealing because it 
avoids discrepancies between voting weight and power. Namely, approximate proportionality between 
the normalized weight vector w — {wi, . . . ,z/;„) and the induced PBI B(q;w) is achieved when the 
quota is set to g* = ^(1 + A desired power vector /3 can hence approximately be induced 

simply by choosing w ~ (3 and then calculating q* . Because the rule {q* ; w) is simple and minimizes 
possible confusion between voting weight and power, it has motivated the prominent "Jagiellonian 
Compromise" (also known as double square root voting system) in the discussion of future voting rules 
for the EU Council (see, e.g.. lKirsch et al. 2007)) . 

Whether the decision quota q is a free parameter, so that Slomczynski and Zyczkowski's optimal 
q* indeed can be chosen, or not, depends on the application at hand. Even if it can, the lack of 
bounds on how well the (g*; /3)-heuristic performs relative to the respective globally optimal solution 
to the inverse problem provides motivation for further research. Knowing more about the quality of 
the (g*; /3)-heuristic is especially important for situations in which the heuristic can be expected to 
perform rather badly. The derivation of g* is based on a continuous approximation of the fundamentally 
discrete distribution of the cumulative weight of a random coalition. Its use is problematic when this 
approximation is inaccurate. This pertains especially - but not exclusively - to "small" voting bodies. 

For a given number n of players, the set of different binary voting systems or simple games is finite. 
On the one hand, this finiteness implies that many desired power distributions cannot be approximated 
too well. Nontrivially, this remains true even for large n: Alon and Edclman's (|2010p results imply 
that there is a sequence of desired power distributions {/3"}ri=i,2.... which stays at least a constant 
positive distance away (in the || • ||i-norm) from any Pcnrose-Banzhaf power distribution. As shown 



Kurz (2012b) the desired power distributions /3" = (0.75, 0.25, 0, 0, . . . , 0) have |1 • || i-distance of at 



least ^ to the PBI of every complete simple game or weighted voting game for 2 < n < 16 players. 

On the other hand, the finiteness of the set of simple games suggests a trivial algorithm for solving 
the inverse problem: enumerate all systems v with n players, compute the respective power distribu- 
tion - say, the PBI B{v) - and then pick a game v* that induces the smallest achievable difference 
between ideal vector /3 and B{v) according to a suitable metric. 

To this end, a growing literature has investigated methods for the efficient enumeration of voting 
systems (see, e.g., [Keijzer 2009) [Keijzer et al. 2010 IKurz 2012a|) . But, up to now, even the number 



of complete simple games (and also of weighted voting games) is unknown for n > 9. So enumeration 
works only for voting bodies with few members. Exact solutions to the inverse problem can, fortunately, 
also be obtained for somewhat larger n by integer linear programming (ILP) techniques. Such an 



^Non-dctcrministic rules such as random dictatorship or random quota rules l |Dubey and Shapley 1979] sec. 5) can 
easily solve the problem, but are generally not regarded as satisfactory. 

•^For very big n, except in rather pathological cases, the distinctions between voting weight and voting power become 
negligible. Limit results for n — > oo which render the inverse problem trivial date back to the seminal work by Lionel S. 
Penrose 119461 11952I I , and have rigorously been investigated by [Lindner and Machover (2004) [ [Chang et al. (2006) [ and 
[Lindner and Owen (2007) [ 
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approach was recently presented in Kurz (2012b) 
of ILP to electoral systems, as discussed in Grilli di Cortona et al 



Ricca et al. (2012) 



It stands in the tradition of earlier applications 
(1999)1 



Pennisi et al. (2007) 



This paper draws on both methods - complete enumeration and ILP - as well as standard local 
search algorithms in order to evaluate the accuracy of three heuristic solutions to the inverse problem 
for the PBI. The first heuristic si mply combines w = with q° = 50% ; the second combines it with 
the "optimal quota" q* derived by Slomczyhski and Zyczkowski (2007) the third uses 9 = 5 + l/^/nn. 
The latter quota is the average of q* computed over a set of vectors which is of part icular interest 
for the egalitarian design of two-tier voting systems (Slomczyhski and Zyczkowski 20111. 

We compute differences between the respective heuristic solution and the exact solution for three 
different metrics and a comprehensive grid of target vectors with up to 71 = 7 voters. We also study 
approximations of the exact solutions for a large sample of grid points for 7 < n < 15 as well as se- 
lected real-world examples based on the so-called Penrose square root rule and EU population figures. 
The results allow the estimation of bounds (and termination criteria) for the accuracy of a candidate 
solution which has been obtained by conventional local search methods. This may be useful in applica- 
tions where a specific voting power distribution is sought for a moderate number of council delegates, 
committee members, or business shareholders. We also analyze the significant magnitude of error that 
the mentioned heuristics can produce even for large n in pathological cases. 

In the following Section [5] we introduce binary voting systems and their basic properties. The 
inverse power index problem is formalized in Section [31 along with a brief discussion of worst-case 
bounds. In Section S] we present the design of our comparative investigation. The corresponding 
results are the topic of Section [5] We conclude in Section [6) 



2 Binary voting systems 

We consider binary voting systems, i.e., each voter i £ N := {1, . . . ,n} casts a binary vote (e.g., "yes" 
or "no") and this determines a binary collective decision. Such a situation can mathematically be 
described by a Boolean function v: 2^ ^ {0, 1}, where 2^ denotes the set of subsets of N . A coalition 
S C N can, e.g., be interpreted as the set of "yes" -voters for a particular (unspecified) proposal. 

Definition 1 A simple game is a monotone Boolean function, i.e., a mapping v: 2^ — > {0,1} with 
v{S) < v{T) for all 5 C T, which additionally satisfies w(0) = 0, v{N) = 1. 

Coalition 5* C is called winning if v{S) = 1, and otherwise losing. S ^ N is called a minimal 
winning coalition if it is winning and all proper subsets are losing. Similarly, a maximal losing coalition 
is losing and all its proper supersets are winning. A simple game is uniquely characterized by either 
its set of minimal winning coalitions or its set of maximal losing coalitions. For a proof and additional 



properties we refer the reader, e.g., to Taylor and Zwicker (1999) 



As an example consider the simple game v which is characterized by the set {{1}, {2, 3}} of its mini- 
mal winning coalitions. Taking all supersets of the minimal winning coalitions yields {{1}, {1, 2}, {1, 3}, 
{1, 2, 3}, {2, 3}} as the set of ah winning coalitions. The remaining subsets are losing, with the set of 
the maximal losing coalitions given by |{2},{3}}. 

The monotonicity imposed in Definition [I] is a very weak requirement. By introducing Isbell's 
desirability relation, i.e., i Zi j for two voters i,j € A^ if and only if v{{i} U >S'\{j}) > v{S) is satisfied 
for all {j} C S* C A^\{i} (see, e.g.. Ilsbell 1956p . one can define a particularly relevant subclass of the 
set of all simple games S: 

Definition 2 A simple game v is called complete (also called dircctedj if the binary relation □ is a 
total preoder, i.e. 

(1) i Z i for all i e N, 

(2) i j or j □ i (including "i □ j and j □ i") for all i,j G N , and 
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(3) i j, j □ h implies i Z\ h for all h e N. 



In our small example one can easily check that 1d2d3. SowgC where C C S denotes the set 
of all complete simple games. Note that also 3 □ 2, i.e., voters 2 and 3 are equally desirable. 
Many binary voting systems which are used in practice belong to a further refinement of S: 



Definition 3 A simple game v is weighted if there exist non-negative weights Wi 
quota q G IR>o such that v{S) ~ 1 if and only if X^ies — 1- 



>o and a positive 



A weighted representation of our small example is given by (g; w) = [2; 2, 1, 1]. We call weighted 
simple games weighted voting games and denote their collection by W. Every weighted voting game 
is complete while not every complete simple game is weighted, i.e., W C C C S. But each complete 
simple game and even each simple game can be represented as the intersection of 1 < fc < c» weighted 
voting games. The minimal number k of weighted voting games is called the dimension of the simple 
game in question (see, e.g., [Taylor and Zwicker 1999[ |Demeko and Woeginger 2006[ ). The presently 
known enumeration results for the three considered classes of binary voting systems are summarized 



in Tablc[l](up to isomorphisms). See, e.g., Kurz (2012a) for details 



n 


1 


2 


3 4 


5 


6 


7 


8 


9 




1 


3 


8 28 


208 


16351 


>4.7- 10*^ 


>1.3- 10^* 


> 2.7- 10^6 


#c 


1 


3 


8 25 


117 


1171 


44313 


16175188 


284432730174 


#w 


1 


3 


8 25 


117 


1111 


29373 


2730164 


989913344 



Table 1: Number of distinct simple games, complete simple games, and weighted voting games 



There are several equivalent representations of binary voting structures besides Boolean functions 
and lists of, e.g., minimum winning coalitions. Simple games can, for instance, be described as in- 
dependent sets in a graph, and Carreras and Freixas (1996) have introduced a very efficient matrix 
parameterization of C. Our computation algorithms will exploit yet another possibility. Namely, we 
use that voting systems can be represented as points of a polyhedron which have integer coordinates 
only: for each S ^ N define < xs < 1 and add the constraints X0 = 0, xat = 1, and xs < xt for all 
<l) C S C T C N. Each integer solution x £ {0, 1}^ of this system of linear inequalities is in bijection 
to a simple game (with v{S) = xs)- Complete simple games and weighted voting games are described 
by additional constraints and auxiliary variables for the weights. This allows the investigation of all 
three classes of voting games by ILP techniques. 



3 The inverse power index problem 

Power indices are mappings from a set of feasible voting structures, such as S or W, to non-negative 
real vectors which are meant to quantify the influence of the members of a voting body on collective 
decisions. The inverse power index problem consists in finding a voting system, e.g., {q; w) € W, which 
induces a power distribution as close as possible to a desired one. More formally, for a given number 
n of voters, the general inverse power index problem involves a set F of feasible voting structures for 
n players, a power index (/>: F ^>0' ^ desired power distribution /3 £ R"q with X]"=i A = I7 and 
a metric d: M" x M" — R>o which measures the deviation between two power vectors. Of course, 
d{x,y) — \\x — y\\ is a suitable choice for any vector norm || • ||. Given these ingredients the inverse 
power index problem amounts to finding a solution to the minimization problem 

mmd{^{v),P). (1) 

In this paper, we consider the special instance of this problem where F G {S,C,W}. We include 
S and C because they are significantly larger domains for n > 5 (see Table [T]) and some prominent 
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real- world electoral systems fail to correspond to weighted voting games. Examples include the current 
voting rules (Treaty of Nice) and the future ones (Treaty of Lisbon) for the EU Council, which are 3- 
and 2-dimcnsional, respectively. We take the (normalized) Pcnrosc-Banzhaf index B{v) as the voting 
power index of interest. 

Definition 4 For a given n-player simple game v the absolute Penrose-Banzhaf index B[{v) for player 
i is defined as 

B-iv)^^- E v{SU{^})-v{S). 

0CSCAr\{i} 

The (normalized) Penrose-Banzhaf index (FBI) Bi{v) for player i is defined as 

Bliv) 



B,iv) 



ruB^ivY 



Our distance computations will be based on the || • ||i-norm (i.e., the sum of deviations between 
Bi{v) and j5i for all players i), the || • ||cc-norm (i.e., the maximum deviation), and a weighted version 
of the former. Section U] will provide more details. 

To the best of our knowledge, there exists only one (non-trivial) non-approximative result on how 
well the inverse problem can be solved for the PBI in the worst case. It is useful for later reference to 



rephrase this rather recent finding by Alon and Edelman (2010) 



Theorem 1 (Alon-Edelman) Let n > k be positive integers, let e < be a positive real, and let 
V be a simple game with n voters. If '^'^^f^^i B{v,i) < e, then there exists a simple game v' with k 
voters such that 



i=l i=fe+l ^ ' 



So given a "large" game v with n players in which 1 — e of the total PBI (normalized to one) 
is concentrated amongst k < n "major" players, it is possible to ignore the n — k "minor" players, 
i.e., compute the PBI in the smaller game v' amongst the major players only, and make an error (in 
the II • Ill-norm) of no more than the stated bound. As an example, consider the power distribution 
/3" = (0.75, 0.25, 0, 0) e ]R>Q for n > 2 and choose k = 2, e = jg. Let v he a. simple game 
involving n players with X]i"=3 Bi{v) < s = j^, From Theorem [T] we conclude the existence of a simple 
game v' involving fc = 2 players with \\B{v) — B{v')\\i < ^^^!^^]f ^ ~ JE (^itli B{v') extended 



-{k+l)6 

P and 

vectors B{v') G {(1,0), (i, i), (0, 1)} is at least i. We, therefore, have 



naturally from ]R?.q to K>o)- The || • ||i-distancc between and the three possible 2-playcr PBI 



\\Biv) -n\i> \\Biv') - /?^||i - \\B{v) B{v')\\, > i - 1 = 1. 

If otherwise Er=3^»(«) > ^ t^cn \\B{v) - ^"||i > (l - Bi{v) - B2{v)) + e > 2e = i. Hence, /3" 
cannot be approximated by the PBI of a simple game with an || • ||i-crror less than ^. The latter is the 
sharpest possible bound obtainable from Theorem [TJ It can be improved computationally to slightly 
more than i| for n < 11 on S and for n < 16 on C and W fsee lKurz 2012bp . 



4 Design of the computational investigation 

When the inverse problem arises in political applications of constitutional design, PBI vectors {3 which 
are proportional to the square root of a population size vector p play an elevated role. The reason is 
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that - under the probabihstic assumptions which underhe the PBI - a binary voting system v with 
B{v) = /3 and 



would cquahze the voting power of citizens in a two-tier system in which n delegates adopt the bottom- 
tier majority opinion of their respective constituency i € {1, . . . , n} and then cast a w^-weighted vote 
in a top-tier assembly (e.g., the EU Council). See Penrose (1946)[ Felsenthal and Machover (1998)[ 



Kaniovski (2008) or Kurz et al. (2012) for details. We will consider this Penrose square root rule for 
varying n and some historical population figures in order to select target vectors /3 in our computations. 

In principle, however, any vector in K"o whose entries sum up to 1 might be a desired power 
distribution /?. For instance, the partners of a non-profit R&D joint venture might have made relative 
financial contributions of (^j f) and possibly want to align a priori voting power to this vector 

as well as possible. Ideally, for a given number n of voters, one would like to compare the exact and 
heuristic solutions to the inverse problem for all possible normalized target vectors /3 S A(?t, — 1), 
where A(n— 1) denotes the n~ 1-dimensional unit simplex. Because this is computationally infeasible, 
we complement our analysis of some politically motivated vectors by vectors /3 from a finite grid on 
A(n — 1). And we resort to approximations of the exact solution when n is too large. 

We will compare the (approximated) exact solution of the inverse problem on domain S, C, or W 
for a given desired PBI (3 with three different heuristics. They stay in the class W of weighted voting 
games and have in common that voting weights are set equal to the desired voting power, i.e., w = p. 
They pick a distinct quota, and hence typically a different voting system w G W. 

The first heuristic - referred to as the 50%-heuristic - just chooses q° = ^. Simple majority is 
arguably the most common majority rule in practice. The 50%-heuristic simply picks it and ignores 
the potentially large discrepancies between voting weight and voting power that can arise. 



The second, more sophisticated heuristic has been suggested by Slomczyhski and Zyczkowski 
I2007[) . Their motivation was to implement PBI vectors proportional to the square root of population 
sizes in the European Union, but the heuristic applies to arbitrary target vectors. Namely, the q*- 
heuristic selects the quota 

1* = l(^ 




for an arbitrary UI = /? G A{n—1). Slomczynski and Zyczkowski (2007) derive this quota by considering 



the random weight W which is accumulated if all coalitions 5 C TV are equiprobable, as the PBI's 
probabilistic justifications suppose. Equiprobability at the level of coalitions is equivalent to assuming 
that each voter i € {1, . . . , n} joins the formed coalition independently of the others with probability 
i. The mean of W hence is ^ = 'l27=i = ^ its variance is cr^ = -j Being the sum of 

independent bounded random variables, W is approximately normally distributed if n is sufficiently 
large and each of the weights is sufficiently smallo Assuming that this is the case and, therefore, that 
the discrete random variable W can be replaced by the continuous one W, the inflection point of the 
corresponding normal density of W is located at g* ~ n + a. Since the second derivative of W^s density 
vanishes at q* , one can approximate the density by a linear function with reasonably high accuracy. 
This lin ear approximation then allows to esta blish approximate proportionality of B{q*;w) and w. We 



refer to 



Slomczynski and Zyczkowski (2007) 



for details. 



Our final heuristic, which we will refer to as the q-heuristic, replaces q* by 

1 1 



^ 2 

This quota approximates the expected value of q* when /3 is proportional to the component-wise square 
root of a population size vector p = (pi, . . . ,pn) which is drawn from a flat Dirichlet distribution (see 



Slomczynski and Zyczkowski 2011). The motivation for computing such an average is the following: 



*A key technical requirement is maxj wj <C y '' 
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even though the (j'*-heuristic can approximate the Penrose square root rule ([2|) very transparently for 
a given population distribution p, frequent changes in the population would call not only for frequent 
changes of the heuristic's preferred voting weights w but also of the quota q* . That current voting 
weights in the EU already recur to population figures, which arc updated on an annual basis, suggests 
that weight changes may be regarded as unproblcmatic. A varying decision threshold - perhaps 
q — 65% in one year, q' = 61% in the next, then q" = 67%, etc. - however seems politically less 
palatable. It may then make sense to average q* over a wide range of values for w = 13 oc and 
the heuristic simply assumes that all population distributions p G A(n — 1) are equally likelyH 
Because g — > i as n — >■ oo, the 50%-heuristic is the limit of the heuristic and can be viewed as an 
approximation of it for not too small n. 



Let us remark that investigations by Kurth (2008) have called attention to numerical problems 



when heuristics which involve irrational voting weights and quotas, as the q* or g-heuristics commonly 
do, are implemented. Rounding after, e.g., 4 decimal places can result in voting systems which differ 
significantly from what was intended. Because it is impractical to deal with weights of a hundred 
decimal places or more, it is attractive to work with the underlying Boolean functions or integer points 
of a suitable polyhedron as long as possible, and to determine minimal integer weights w and a quota 
q which jointly represent a given w G W when needed^ We use this approach here whenever possible. 



and refer the interested reader to Freixas and Molinero (2009) or Kurz (2012a) 



We calculate the globally optimal solution to the inverse problem for a given target PBI /? by 
complete enumeration of the elements in the respective class of binary voting systems for n < 7 (see 
Table [1]). For larger n, we mostly focus on approximations of the exact solution. These are obtained 
either by a fast local search algorithm or, preferably, by ILP techniques. How the latter are used is 
explained in the Appendix. The implemented ILP-based algorithm in principle yields globally optimal 
solutions but is interrupted for computation time reasons when a desired precision has been reached. 
The key idea is to consider the integer polyhedron which contains all simple games whose PBI is less 
than a given factor a > away from the desired vector /?. If this polyhedron is empty, no such game 
exists and a needs to be raised. If not, a can be lowered. The minimal level of a or an approximation 
with desired precision, together with the corresponding voting systems, can thus be found by the 
bisection method. 

In evaluating the quality of the mentioned heuristics, we consider distances to the desired power 
vector, /?, and to the globally optimal one, B{v*), in three different metrics. The first one is the metric 
c?i {x, y) := \\x — y\\i = ^^^=1 ~ Vi I induced by the || • || i-norm, which is also considered in Theorem[T] 
The second is the metric induced by the || • ||oo-iiorm, i.e., doc,{x,y) := \\x — y\\cx> = maxigji „} jx^ — 
We refrain from al so considering the Euclidean metric induced by the || • ||2-norm. which has been 



considered, e.g., by Slomczynski and Zyczkowski (2007) The reason is that this would turn the ILP 
formulation of the inverse power index problem into a binary non-linear programming one. This would 
be considerably harder to solve and add relatively little information because ||a;||oo < \\^\\2 v^ll^^lloo 
for all xeW. 

More interesting, in our view, is a variation of di which takes the Bernoulli model that underlies 
the PBI and Penrose's square root rule seriously. This model assumes that all bottom-tier voters in 
constituency i £ {!,... ,n} cast a "yes" or "no" vote equiprobably and independently of all others. 
The probability for one of pi individual voters in constituency i to be pivotal for the constituency's 
aggregate decision - i.e., to induce the i-delegate at the top-tier council to cast voting weight Wi in 
favor of "yes" by individually voting "yes", and "no" by voting "no" - is approximately \/2/(7rpi). 
The joint probability of a given voter being pivotal in his constituency i and of this constituency 
being pivotal at the top tier is hence Bi{v) ■ ^J2/ (-Kpi). This is why the square root PBI vector in 
equation ^ equalizes the indirect infiuencc of citizens on collective decisions across constituencies. If 
one now weights any deviation between the probability for a given voter in constituency i to be doubly 



^The expected value of the p-specific optimal quotas q*{p) for a particular (e.g., Dirichlet) distribution of p, of course 
need not coincide with the quota that is optimal when p is treated as a random v ariable. Stochastic optimization tech- 



niques are likely to yield a somewhat better q- heuristic than the one suggested by Slomczynski and Zyczkowski (2011) 

minimal integer representation of a weighted voting game has the additional advantage that the PBI and other 
power indices can be computed most efficiently. 



7 



pivotal and the egalitarian ideal of Pi ■ ■\/2/ [irpi) with fii = ^Jvlj 'YTj=\ \/p1 equally, then the total 
misrepresentation associated with the top-tier voting system v amounts to 

n n 

Y^P, ■ |A - B,{v)\ ■ V2/{ttp,) = c • ^ • - B,{v)\ 

i=l 4=1 

for c > 0. Whenever the desired vector /3 is derived from Penrose's square root rule and a vector p 
which represents EU population data, wc will, therefore, also consider the variation of metric di which 
weights absolute deviations by the square root of relative population, i.e., study the metric^ 



d'i{x,y) ^.h^^ \xt - yz\- 

i=i V ^j=iPi 



5 Computational results 

In this section we present our numerical results. Subsection 15.11 considers the EU Council of Ministers 
as a prototype of a real-world weighted voting system. We then look at the entire space of possible 
power distributions for n < 7 and a random sample thereof in Subsection 15.21 In order to study 
analytically how deviations between heuristics and actual optimization depend on ri, we investigate a 
particular parametric example in Subsection 15.31 



5.1 Examples of real-world weighted voting systems 

We consider the (EEC or EC or) EU Council of Ministers in the years 1958, 1973, 1981, 1986, 1995, 
2006, and 2011 with respectively n G {6, 9, 10, 12, 15, 25, 27} members. The historical population data 



for n G {6, . . . , 15} arc taken from Felsenthal and Machover (1998, sec. 5.3) the data for n e {25, 27} 



are official Eurostat figures downloaded on 19.01.2012. The desired power distribution /? is computed 
by Penrose's square root rule (sec equation 

In Tables we compare the three considered heuristics under different metrics with the optimal 
solution of the inverse power index problem, where we distinguish S, C, and W as the set of feasible 
voting structures. Besides the absolute deviations measured in the respective metric we also report a 
relative measure: if the deviation of a certain heuristic is given by 6 and that of the optimal solution 
V* € S is a - this is the unavoidable absolute "error" associated with the given instance of the inverse 
problem - then the tables report the avoidable error (6 — a) /a relative to global optimization in S 
(labeled S-error in the tables). A value of 1, e.g., means that the hcuristic's approximation error is 
twice the unavoidable one. The "f" -symbol indicates that the stated value is not computationally 
proven to be optimal: for simple games and n = 9, for instance, we stopped the ILP solution process 
after memory usage of 31 GB and 18461700 branch-and-bound nodes; for n ~ 10, we interrupted after 
301 GB and 16735508 nodes. Light figures for 5 or C represent lower bounds inferred from W, and 
"oo" entries indicate deviations by factors greater than 500. 

Independently of the chosen metricH the tables show: (i) the approximation errors of the heuristics 
and the optimal solutions in W (and a fortiori in C and S) tend to zero as n increases; (ii) except 
for n = 9, the q*- and the g-heuristics perform noticeably better than the 50%-heuristic; (iii) the q* 
and g-heuristics produce comparable errors for n < 15 but differ significantly for n > 15; (iv) the 
respective optimal weighted games v*** G W yield deviations that are only moderately higher than 
those of V* € S; and (v) the relative errors of the heuristics compared to either v* £ S or v** £ W are 
sizeable even for small n < 15 and become huge for n > 25. 

The last observation is probably the most interesting: whenever one seeks an optimal solution 
of the inverse power index problem, all three heuristics arc unsatisfactory from a pure operations 



^Consideration of a similar variation of doc broadly confirms the comparisons based on di, d'^^, and doo- 
*Notc that the three metrics behave differently when, e.g., distance between (1, 0, ... , 0) and (i, . . . , i) g A(n - 
is considered for increasing n. Deviations should, therefore, be compared only within and not across tables. 
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V* eS 


i;** e C 


V*** e W 


50%-hcuristic 


(7*-hcuristic 


g-heuristic 


^^ 


di 


di 


di 


di S 


-error 


di S 


-error 


di S 


-error 


6 


0.051857 


0.051857 


0.051857 


0.300398 


4.79 


0.091100 


0.76 


0.091100 


0.76 


9 


0.005294t 


0.008641 


0.010359 


0.065528 


11.38 


0.060195 


10.37 


0.069792 


12.18 


10 


0.002639t 


0.004840 


0.072186t 


0.038751 


13.68 


0.033229 


11.59 


0.026466 


9.03 


12 


0.0023441' 


0.002937^ 


0.005170^' 


0.028700 


11.24 


0.019827 


7.46 


0.019827 


7.46 


15 


0.000476 


0.000476 


0.000476t 


0.026742 


55.18 


0.006820 


13.33 


0.006361 


12.36 


25 


0.000000 


0.000000 


o.oooooot 


0.019422 


"oo" 


0.000744 


"CX)" 


0.003096 


"oo" 


27 


0.000000 


0.000000 


o.oooooot 


0.018003 


"oo" 


0.000633 


"C30" 


0.002457 


"oo" 



Table 2: Performance for Penrose square root targets in the di-metric (1958-2011 EU data) 



research perspective. The heuristic solutions can be improved by very large factors, and this becomes 
more rather than less pronounced as n grows. Of course, from an applied point of view the absolute 
approximation errors get so small for large n that they may be regarded as negligible. But they may 
still be relevant. In order to see what a deviation at the 5th decimal place means consider, e.g., the 
Penrose square root power distribution for the EU Council from 2011 and compare it to /3-^^' which 
would result if 50000 people moved from Germany to France or were mis-counted in the statistics. 
Then 11^27 _ ^27/||^ _ 0.0000634. 





V* eS 


V** e C 


V*** e W 


50%-heuristic 


g*-heuristic 


g-heuristic 


n 


d[ 


d[ 


d[ 


d[ S 


-error 


d[ S 


-error 


d[ S 


-error 


6 


0.018967 


0.021487 


0.021487 


0.110284 


4.81 


0.027465 


0.45 


0.027465 


0.45 


9 


0.001902t 


0.002752 


0.003513 


0.019015 


9.00 


0.018935 


8.96 


0.017643 


8.28 


10 


0.000803t 


0.001502t 


0.001909t 


0.008893 


10.07 


0.007325 


8.12 


0.005489 


5.84 


12 


0.000309t 


0.001164t 


o.ooosiot 


0.007840 


24.37 


0.004005 


11.96 


0.004005 


11.96 


15 


0.000152 


0.000152 


0.0001521' 


0.007790 


50.26 


0.001230 


7.09 


0.001554 


9.23 


25 


0.000000 


0.000000 


o.oooooot 


0.004874 


"oo" 


0.000213 


"oo" 


0.000751 


"oo" 


27 


0.000000 


0.000000 


O.oooooot 


0.004411 


"oo" 


0.000176 


"oo" 


0.000578 


"oo" 



Table 3: Performance for Penrose square root targets in the d'j^-metric (1958-2011 EU data) 





V* e S 


V** e C 


V*** e W 


50%-heuristic 


(j*-heuristic 


g-heuristic 


n 




doo 




doo S 


-error 


doo S 


-error 


doo S 


-error 


6 


0.014948 


0.014948 


0.014948 


0.082758 


4.54 


0.032728 


1.19 


0.032728 


1.19 


9 


0.001498^ 


0.001840 


0.002240 


0.019238 


11.84 


0.015909 


9.62 


0.023179 


14.47 


10 


0.000575t 


0.001500^ 


0.001960^ 


0.011574 


19.13 


0.006316 


9.98 


0.009721 


15.91 


12 


0.000229^ 


0.000580t 


0.000865t 


0.007940 


33.67 


0.005756 


24.13 


0.005756 


24.13 


15 


0.000066 


0.000066 


0.0000661 


0.005923 


88.74 


0.001798 


26.24 


0.001202 


17.21 


25 


0.000000 


0.000000 


O.oooooot 


0.003834 


"oo" 


0.000173 


"oo" 


0.000384 


"oo" 


27 


0.000000 


0.000000 


O.oooooot 


0.003434 


"oo" 


0.000156 


"oo" 


0.000277 


"oo" 



Table 4: Performance for Penrose square root targets in the doo-metric (1958-2011 EU data) 



5.2 Finite grid of objective vectors 

Every vector in R"q whose entries sum to 1 can, in principle, be a desired power distribution in a 
specific context. We approximate this infinite space by a discrete grid. We impose Pi > (32 > ■ ■ ■ > Pn 
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and let the desired power of the first n— 1 voters be an integr al multiple of s^O.OlU So given n and 
s, a finite set of desired power distributions arises, which we call grid points^^ Table [S] reports key 
statistics for the distribution of unavoidable deviations from the ideal vectors in the di and doo-metrics: 
its median, average, 10%, 5%, and 1%-percentile. The deviation figures are based on the respective 
exact solutions in W for n < 7 and approximations thereof for larger n. A number of grid points 
in parentheses indicates the size of the considered random sample whenever only a subset of all grid 
points could be dealt with. The deviation statistics in the corresponding row (in light color) involve 
a sample error in addition to the small error of using a conventional local search algorithm instead of 
global optimization in W. 





#grid 








d 


i-metric 
















metric 








n 


points 


med. 


av. 


10% 


5% 


1% 


med. 


av. 


10% 


5% 


1% 


2 


51 





240 





245 


0.040 





020 





000 





120 





123 





020 





010 





000 


3 


858 





240 





229 


0.100 





073 





027 





120 





115 





050 





037 





013 


4 


7519 





160 





162 


0.087 





067 





040 





070 





071 





040 





030 





017 


5 


41334 





100 





112 


0.060 





052 





033 





040 





042 





023 





020 





Oil 


6 


160668 





066 





077 


0.040 





036 





021 





022 





025 





013 





Oil 





008 


7 


477213 





041 





051 


0.026 





021 





017 





012 





015 





008 





007 





005 


8 


(10000) 





042 





046 


0.026 





023 





017 





012 





014 





007 





006 





005 


9 


(10000) 





032 





035 


0.020 





018 





014 





008 





010 





005 





004 





003 


10 


(10000) 





025 





027 


0.016 





014 





Oil 





006 





007 





004 





003 





003 


11 


(10000) 





020 





022 


0.013 





012 





009 





005 





005 





003 





002 





002 


12 


(10000) 





016 





018 


0.011 





009 





007 





004 





004 





002 





002 





001 


13 


(10000) 





013 





014 


0.008 





007 





006 





003 





003 





002 





001 





001 


14 


(10000) 





010 





Oil 


0.006 





006 





004 





002 





002 





001 





001 





001 


15 


(10000) 





008 





009 


0.005 





004 





003 





002 





002 





001 





001 





001 



Table 5: Distribution of unavoidable absolute deviations d\{(3,B{v*) and doo{l3, B{v*) 



Tables IHHH] report analogous statistics for the distribution of absolute distances for the three heuris- 
tics (considering each grid point for n < 11 and samples from the respective grid for n > 11). A 
comparison of the respective deviation statistics with those in Table [S] confirm the observations that 
were made for the very specific grid points derived from Penrose's square root rule in Section ISTTI the 
average and each reported percentile of the avoidable deviations decrease in n. They can be regarded 
as small in absolute terms, but they are sizeable in relative terms. Again the 50%-heuristic is clearly 
outperformed (in the sense of first order stochastic dominance) by the q* and g- heuristics for n > 3. 



5.3 Analytical example 

That relative deviations between the considered heuristics and globally optimal solutions need not 
disappear for n — >■ cx) can be seen very transparently by considering the desired power distribution 

n— 1 twos 

/3" :=-^(2r^,l) 

2n — 1 



^The desired power of the n-th voter is implied by the sum condition. 

^'^Step size s has to be chosen with care: the number of grid points can bo intractably great already for small n if s 
is too small. But a larger s induces a coarser grid of feasible target vectors. This becomes more and more problematic 
as n increases because of the corresponding natural decrease of an individual voter's relative power (on average equal to 
1/n). Choosing s = 0.03, for instance, would result in only 297 different grid points (i.e., distinct power distributions 
with /3i > /32 > • • ■ > /3n and /3i = kiS for ki G N) for n = 17 as opposed to 1297 points for n = 8. 
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#grid 








d 


1 -metric 














d 




metric 








n 


points 


med. 


av. 


10% 


5% 


1% 


med. 


av. 


10% 


5% 


1% 


2 


51 





480 





480 


0.080 





020 





000 





240 





240 





040 





010 





000 


3 


858 





563 





562 


0.200 





140 





060 





282 





281 





100 





070 





030 


4 


7519 





427 





512 


0.207 





160 





087 





210 





251 





087 





067 





037 


5 


41334 





340 





446 


0.165 





129 





077 





150 





208 





063 





049 





029 


6 


160668 





293 





383 


0.129 





105 





067 





117 





173 





045 





036 





023 


7 


477213 





240 





330 


0.102 





081 





053 





093 





146 





033 





026 





016 


8 


1145180 





200 





289 


0.080 





064 





043 





076 





127 





025 





019 





012 


9 


2320234 





170 





256 


0.066 





053 





036 





064 





112 





019 





015 





009 


10 


4094767 





147 





230 


0.056 





045 





031 





055 





101 





016 





012 





008 


11 


6449747 





129 





210 


0.049 





040 





028 





048 





092 





013 





010 





006 


12 


(100000) 





031 





036 


0.020 





018 





014 





008 





010 





004 





004 





003 


13 


(100000) 





026 





029 


0.017 





015 





012 





006 





008 





003 





003 





002 


14 


(100000) 





022 





024 


0.015 





013 





Oil 





005 





006 





003 





002 





002 


15 


(100000) 





019 





021 


0.013 





012 





010 





004 





005 





002 





002 





002 



Table 6: Distribution of absolute deviations for the 50%-lieuristic 





#grid 








d 


1 -metric 














d 




metric 








n 


points 


med. 


av. 


10% 


5% 


1% 


med. 


av. 


10% 


5% 


1% 


2 


51 





480 





480 


0.080 





020 





000 





240 





240 





040 





010 





000 


3 


858 





380 





425 


0.160 





107 





040 





190 





213 





080 





053 





020 


4 


7519 





340 





366 


0.160 





120 





060 





145 





170 





065 





050 





025 


5 


41334 





271 





306 


0.133 





107 





066 





110 





134 





052 





040 





024 


6 


160668 





220 





256 


0.108 





087 





058 





085 





108 





037 





030 





020 


7 


477213 





180 





216 


0.084 





069 





046 





065 





089 





026 





021 





014 


8 


1145180 





150 





183 


0.063 





051 





034 





053 





074 





019 





015 





010 


9 


2320234 





125 





157 


0.048 





038 





025 





043 





063 





013 





010 





006 


10 


4094767 





104 





137 


0.036 





028 





018 





035 





055 





010 





007 





004 


11 


6449747 





087 





121 


0.028 





021 





013 





029 





049 





007 





005 





003 


12 


(100000) 





017 





021 


0.009 





008 





006 





004 





005 





002 





002 





001 


13 


(100000) 





012 





015 


0.006 





005 





004 





003 





004 





001 





001 





001 


14 


(100000) 





008 





Oil 


0.004 





004 





003 





002 





003 





001 





001 





000 


15 


(100000) 





005 





008 


0.003 





003 





002 





001 





002 





001 





000 





000 



Table 7: Distribution of absolute deviations for the g*-heuristic 
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=//:Srid 

IT o 








d 


1 -metric 














d 




metric 








n 


points 


med. 


av. 


10% 


5% 


1% 


med. 


av. 


10% 


5% 


1% 


2 


51 





280 





327 


0.040 





020 





000 





140 





164 





020 





010 





000 


3 


858 





320 





334 


0.140 





100 





040 





160 





167 





070 





050 





020 


4 


7519 





300 





305 


0.150 





113 





060 





130 





138 





065 





050 





025 


5 


41334 





247 





261 


0.132 





101 





060 





100 





109 





050 





040 





023 


6 


160668 





200 





220 


0.103 





085 





056 





075 





086 





035 





028 





019 


7 


477213 





153 





181 


0.077 





063 





043 





055 





067 





024 





020 





013 


8 


1145180 





117 





148 


0.056 





046 





032 





040 





052 





016 





013 





009 


9 


2320234 





093 





125 


0.042 





034 





024 





030 





043 





012 





009 





006 


10 


4094767 





073 





106 


0.031 





025 





017 





023 





036 





008 





006 





004 


11 


6449747 





065 





094 


0.025 





020 





013 





021 





032 





006 





005 





003 


12 


(100000) 





017 





021 


0.009 





008 





006 





004 





005 





002 





002 





001 


13 


(100000) 





012 





015 


0.006 





006 





004 





003 





004 





001 





001 





001 


14 


(100000) 





008 





Oil 


0.004 





004 





003 





002 





002 





001 





001 





001 


15 


(100000) 





006 





008 


0.003 





003 





002 





001 





002 





001 





000 





000 



Table 8: Distribution of absolute deviations for the g-heuristic 



for n > 20 For any quota q € l{ := • (2j - 1, 2j], where 1 < j < n - 1 and j G N, the PBI of 
the smallest constituency is exactly zero and, by symmetry, the PBI of the other constituencies equals 
^^^Tj-. For the remaining possibilities q € I2 '■= 2n-i ' ^-^ where < j < ?i — 1, all constituencies 
have a PBI of i. Denoting the corresponding weighted games by v" ^ and V2 j one obtains 

di{vl,,r) 



2n-l' 

2 71-1 



2n — 1 n 
doo«,,/3") = T^r^' and 



2n- 1 

1 n-1 



2n — 1 n 



So independently of the quota the H ■ ||i-crror is 2n-i '^0{ri ^) and the || • ||oo-error is 2n-i 0{n ■^j. 
The q* and g-heuristics prescribe quotas of 

1 1 

9=2+ 



, 1 , V4^r^ 

^ =2+^^73^' 

respectively. They and q° = 50% fall into If and /| for infinitely many n. Thus, all three rules 
render the smallest constituency a null player infinitely many times as n 00, just as it happened to 
Luxembourg in the EEC Council between 1958 and 1973. 

In contrast, there is always a simple game whose PBI attains /3" exactly for 6 < n < 13. And we 
conjecture that this remains true for n > 14. Approximation results for complete simple games and 
the heuristic choice oi w ~ with an "optimal" quota q that leads to V2 j (abbreviated as q-heuristic) 
are summarized in Tables |9] and 1101 Since the unavoidable error in the class of simple games S (and 
hence of finite intersections of weighted games w G W) is zero for 6 < n < 13 and presumably beyond, 



^^The construction is inspired by a sequence of weighted voting games to which the Penrose limit theorem (see fn. [3t 
does not apply. 
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V* 


V** e C 


V 


*** e w 




g-heuristic 


n 




di 




di 




di 




di 


C-error 


2 





333333 





333333 





333333 





333333 








3 





266667 





266667 





266667 





266667 





000000 


4 





214286 





214286 





214286 





214286 





000000 


5 





038647 





158730 





158730 





177778 





120000 


6 





000000 





113636 





113636 





151515 





333333 


7 





000000 





085470 





085470 





131868 





542857 


8 





000000 





066667 





066667 





116667 





750000 


9 





000000 





064171 





064171 





104575 





629630 


10 





000000 





061042 





061042 





094737 





552000 


11 





000000 





052158 





052158 





086580 





659944 


12 





000000 





047254 





047254 





079710 





686856 


13 





000000 





042353^ 





044483^ 





073846 





743590 



Table 9: Deviations from /3" in the di-metric (analytical example) 



we consider the C-error in order to evaluate the relative performance of the g-heuristic. Interestingly, 
the C-error in the di-metric seems to converge to a constant while it seems to grow without bound for 
the doo-metric. 





V* €S 


V** e C 


V 


*** e yv 




g-heuristic 


n 












doo 




doo 


C-error 


2 





166667 





166667 





166667 





166667 





000000 


3 





133333 





133333 





133333 





133333 





000000 


4 





107143 





107143 





107143 





107143 





000000 


5 





019324 





050505 





050505 





088889 





760000 


6 





000000 





034759 





034759 





075758 


1 


179487 


7 





000000 





022624 





022624 





065934 


1 


914286 


8 





000000 





015686 





015686 





058333 


2 


718750 


9 





000000 





014199 





014199 





052288 


2 


682540 


10 





000000 





008772 





008772 





047368 


4 


400000 


11 





000000 





008282 





008282 





043290 


4 


227273 


12 





000000 





007688 





007688 





039855 


4 


183908 


13 





000000 





005373^ 





007083^ 





036923 


5 


871952 



Table 10: Deviations from /?" in the doo-metric (analytical example) 



6 Conclusion 

The computations which we have reported in Section 15.11 confirm that if one wants to implement 
the Penrose square root rule for population data from today's European Union, the (7*-hcuristic of 
Slomczyhski and Zyczkowski and, to a lesser extent, the even simpler heuristic perform very well in 
absolute terms. That is, the distance between a (normalized) square root target distribution /3 and 
the PBl B{q*,(3) is close to zero. However, the considered heuristics can still be very far from the 
globally optimal solution to the inverse problem in relative terms. This finding applies even when only 
weighted voting games are allowed as feasible solutions. And it is not restricted to small voting bodies, 
but holds for the current number of EU members n = 27. 

The extensive computations reported in Section 15.21 confirm this observation. They provide the 
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first systematic evaluation of tlie unavoidable deviations between arbitrary target PBI power vectors 
and those that are actually implementable for voting bodies with up to n = 15 members. Numbers 
such as the ones reported in Table [S] can potentially be useful in order to improve termination criteria 
for local search algorithms (e.g., Leech [201^1 12003]) . which have been used in applied studies. If, say, a 
locally optimal candidate solution for an inverse problem with n = 11 voters has a di-deviation from 
the desired vector (3 greater than 0.020, then Table [5] indicates that the odds of further improvements 
in the class of weighted voting games are 50:50 and search presumably should continue in a different 
part of the game space. If, however, the deviation is smaller than 0.009, then the odds are rather 1:99; 
termination might then make sense. 

That desired PBI distributions which concentrate a major share of relative power amongst a few 
vo ters pose problems for the considere d heuristics is not surprising. After all, the derivation of q* 
by Slomczyhski and Zyczkowski (2007) supposed a technical condition (see fn. |4]) from which one can 
conclude Wj = fij G 0{\/^Jn)^ i.e., the PBI of a single voter should approach zero as least as fast as 
1/y/n. It is much less obvious, however, that (i) it is not sufficient to have a target vector j3 without 
"outliers" in order to obtain a heuristic solution that is good relative to the exact one and (ii) the 
relative errors may get larger rather than smaller as n increases. This emerged from the extensive 
numerical computations reported in Sections I5.1H5.2I and has also been demonstrated for a specific 
analytical example in Section 15.31 One might, therefore, summarize our findings as justifying and 
potentially even calling for case-specific optimization rather than the application of a generally rather 
good heuristic - not only for small but even for large voting bodies. 
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Appendix: ILP formulation for the inverse Penrose-Banzhaf 
index problem 

The following ILP formulation considers the inverse problem on the class of simple games S and for the 
di-mctrie. Adaptations to C or W and d'l or doo involve further variables and (modified) constraints, 
but are otherwise very similar: 



xs e {0,1} 


\fSCN, 


(3) 


Xs < XT 




(4) 


X0 — u 






XN = 1 




(6) 


y.,s e {0, 1} 


VI < i < n,5 C N\{i}, 


(7) 


yi,S = Xs\j{i} - xs 


yi<i<n,S C N\{i}, 


(8) 


>0 


yi < i < n, 


(9) 




\/l<i<n, 


(10) 


^ — ^l—l 




(11) 


s, >o 


yi <i <n, 


(12) 


Si > Si - Pi- s 


yi <i <n, 


(13) 


Si > -Si + Pi - s 


yi < i < n, 


(14) 


> 5i < a ■ s. 
^ — ^i—i 




(15) 



The binary variables xs define a Boolean function v via v{S) = xs', inequalities ([3])-® ensure that 
they represent a simple game. The binary auxiliary variables yi^s = Xsu{i} — xs which are introduced 
in ([T])-® for alH G A'' and C C -/V\{z} satisfy yi^s = 1 if and only if coalition 5 is a swing for 
voter i. With this the number of swings Si for each player i is determined in equality ()10p . Since the 
PBI for voter i is given by Bi{v) ~ Si/^^^^ sj, the objective is to minimize the c?i-distance 



E 



En 
.7 = 1 ' 



Unfortunately, the quotient cannot be linearized. We, therefore, introduce s = X]r=i inequality 
pT|) and capture Si > \si — (3i ■ s\ by inequalities and p^ Pn Instead of directly minimizing the 
sum of all Si we introduce the constraint (fT5|) for a constant a E [0,2]. Here, a = 2 reflects the 
supremum of di-distances between elements of A{n — 1), and a = corresponds to identity of B{v) 
and desired vector /3. 

It remains to note that if the ILP ([5|)- (fT5|) has a solution, then the corresponding simple game v 
approximates the desired power distribution /3 with an error of at most a in the || • ||i-norm. Otherwise, 
no such approximation is possible. We can hence minimize the deviation by performing bisections on a. 
Since s lies between n and m{^^ < n2" where m = [^J + 1 (see, e.g.. IFelsenthal and Machover 19981 

sec. 3.3) two distinct PBI vectors differ, both in the di- and the doo-metric, by at least (;;^)^. We 
hence only need 0{n) bisections on a. The computations were been carried out using the Gurobi 4.6 
and the IBM ILOG CPLEX 12.4 software packages. 



^■^Interestingly, one can easily linearize the analogous inverse problem for the Shapley-Shubik power index (SSI). So 
even though the PBI is easier to compute than the SSI, the corresponding inverse problem is slightly more difficult. 
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